A New Criterion in Selection and Discretization of Attributes for the Generation of Decision Trees
نویسندگان
چکیده
It is important to use a better criterion in selection and discretization of attributes for the generation of decision trees to construct a better classifier in the area of pattern recognition in order to intelligently access huge amount of data efficiently. Two well-known criteria are gain and gain ratio, both based on the entropy of partitions. We propose in this paper a new criterion based also on entropy, and use both theoretical analysis and computer simulation to demonstrate that it works better than gain or gain ratio in a wide variety of situations. We use the usual entropy calculation where the base of the logarithm is not two but the number of successors to the node. Our theoretical analysis leads some specific situations in which the new criterion works always better than gain or gain ratio, and the simulation result may implicitly cover all the other situations not covered by the analysis.
منابع مشابه
A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملA Fuzzy Compromise Programming Solution for Supplier Selection in Quantity Discounts Situation
Supplier selection, Multi-objective decision making, Fuzzy Compromise programming, Supply chain management, Quantity discount . Supplier selection is naturally a complex multi-objective problem including both quantitative and qualitative factors. This paper deals with this issue from a new view point. A quantity discount situation, which plays a role of motivator for buyer, is con...
متن کاملA new evaluation model for selecting a qualified manager by using fuzzy Topsis approach
Considering the contemporary business settings, managers’ role is more than essential to the viability and further development of an organization. Managers should possess such skills in order to effectively cope with the competition. Multiple attributes decision making (MADM) is an approach employed to solve problems involving selection from among a finite number of alternatives. The aim of thi...
متن کاملMulti-Interval Discretization of Continuous-Valued Attributes for Classification Learning
Since most real-world applications of classification learning involve continuous-valued attributes, properly addressing the discretization process is an important problem. This paper addresses the use of the entropy minimization heuristic for discretizing the range of a continuous-valued attribute into multiple intervals. We briefly present theoretical evidence for the appropriateness of this h...
متن کاملResilient Supplier Selection in a Supply Chain by a New Interval-Valued Fuzzy Group Decision Model Based on Possibilistic Statistical Concepts
Supplier selection is one the main concern in the context of supply chain networks by considering their global and competitive features. Resilient supplier selection as generally new idea has not been addressed properly in the literature under uncertain conditions. Therefore, in this paper, a new multi-criteria group decision-making (MCGDM) model is introduced with interval-valued fuzzy sets (I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Pattern Anal. Mach. Intell.
دوره 19 شماره
صفحات -
تاریخ انتشار 1997